Model Selection

Multimodal audio processing

# Multimodal audio processing

Kimi-Audio is an open-source foundational audio model that excels in audio understanding, generation, and dialogue.

Speech Recognition Supports Multiple Languages

Pathumma Llm Audio 1.0.0

Pathumma-llm-audio-1.0.0 is an 8-billion-parameter Thai large language model specifically designed for audio comprehension tasks, capable of processing various audio inputs including speech, general audio, and music.

Transformers Supports Multiple Languages

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase